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(54) Segment synthesis 

(57) The invention relates to the field of molecular 
recognition or detection of discontinuous or conforma- 
tional binding sites or epitopes corresponding to a bind- 
ing molecule, in particular in relation to protein-protein 
protein- nucleic acid, nucleic acid-nucleic acid orbiomol- 
ecule-ligand interactions. The invention provides a syn- 
thetic molecular library allowing testing for, identifica- 



tion, characterisation or detection of a discontinuous 
binding site capable of interacting with a binding mole- 
cule, said library having been provided with a plurality 
of molecules, each molecule of said molecules compris- 
ing at least one first segment linked to a second seg- 
ment, each segment having the capacity of being a po- 
tential single part of a discontinuous binding site. 
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Description 



[0001] The invention relates to the field of molecular 
recognition or detection of discontinuous or conforma- 
tional binding sites or epitopes corresponding to or in- 5 
teracting with a binding molecule, in particular in relation 
to protein-protein or protein-ligand interactions. 
[0002] interactions between binding molecules, 
which in general are biomolecules, and their corre- 
sponding ligands, are central to life. Cells often bear or 10 
contain receptor molecules that interact or bind with a 
hormone, a peptide, a drug, an antigen, an effector mol- 
ecule or with another receptor molecule; enzymes bind 
with their substrate; antibody molecules bind with an an- 
tigen, nucleic acid with protein, and so on. By "interact is 
or bind" it is meant that the binding molecule and ligand 
approach each other within the range of molecular forc- 
es, and may influence each others properties. This ap- 
proach takes the binding molecule and its ligand through 
various stages of molecular recognition comprising in- 20 
creasing degrees of intimacy and mutual effect: they 
bind. Binding molecules have this binding ability be- 
cause they comprise distinct binding sites allowing for 
the recognition of the ligand in question. The ligand, in 
turn, has a corresponding binding site, and only when 25 
the two binding sites can interact by - essentially spatial 
- complementarity, the two molecules can bind. Need- 
less to say that, molecules having three dimensions, 
binding sites are of a three dimensional nature, often 
one or more surface projections or protuberances of one 30 
binding site correspond to one or more pockets or de- 
pressions in the other, a three-dimensional lock-and-key 
arrangement, sometimes in an induced-fit variety. 
Sometimes, such a protuberance comprises a single 
loop of the molecule in question, and it is only this pro- 35 
tuberance that essentially forms the binding site. In that 
case one often terms these binding sites as comprising 
a linearorcontinuous binding site, wherein a mere linear 
part of the molecule in question is in essence responsi- 
ble for the binding interaction. This terminology in widely 40 
used to describe for example antibody-antigen reac- 
tions wherein the antigen comprises part of a protein se- 
quence, a linear peptide. One than often speaks about 
a linear or continuous epitope, whereby the binding site 
(epitope) of the antigenic molecule Is formed by a loop 45 
of consecutively bound amino acids. However, similar 
continuous binding sites (herein epitope and binding site 
are use interchangeably) can be found with receptor- 
antigen interactions (such as with a T-cell receptor), with 
receptor-ligand interactions such as with hormone re- so 
ceptors and agonists or antagonists thereof, with recep- 
tor-cytokine interactions or with for example enzyme- 
substrate or receptor-drug interactions, whereby a line- 
ar part of the molecule is recognised as the binding site, 
and so on. 55 
[0003] More often, however, such a protuberance or 
protuberances and depressions comprise various, dis- 
tinct parts of the molecule in question, and it are the 



combined parts that essentially form the binding site. 
Commonly, one names such a binding site comprising 
distinctparts of the molecule in question a discontinuous 
or conformational binding site or epitope. For example, 
binding sites laying on proteins having not only a primary 
structure (the amino acid sequence of the protein mol- 
ecule), but also secondary and tertiary structure (the 
folding of the molecule into alpha-helices or beta-sheets 
and its overall shape), and sometimes even quaternary 
structure (the interaction with other protein molecules) 
may comprise in their essential protuberances or de- 
pressions amino acids or short peptide sequences that 
lay far apart in the primary structure but are folded close- 
ly together in the binding site. 

[0004] Due to the central role binding molecules and 
their ligands play in life, there is an ever expanding in- 
terest in testing for or identification of the nature or char- 
acteristics of the binding site. Not only is one Interested 
in the exact nature of the particular interaction between 
binding molecule and ligand in question, for example in 
orderto replace or supplement binding molecules or lig- 
ands when needed; one is also interested in knowing 
approximating characteristics of the interaction, in order 
to find or design analogues, agonists, antagonists or 
other compounds mimicking a binding site or ligand in- 
volved. 

Versatile and rapid methods to test for or identify con- 
tinuous epitopes or binding sites are known. Most, if not 
all nucleic acid detection techniques, and molecular li- 
braries using these, entail hybridisation of an essentially 
continuous nucleic acid stretch with a complementary 
nucleic acid strand, be it DNA, RNA or PNA. Little atten- 
tion has been paid to methods allowing rapid and 
straightforward identification of discontinuous binding 
sites of an essentially nucleic acid nature. Although 
plenty of such sites exist, think only of the lack of under- 
standing surrounding ribosomal binding sites where ri- 
bosomal proteins bind to tRNA, of regulatory sites in pro- 
motor sequences, of interactions between polymerases 
and replicases between DNA and RNA, and so on, no 
molecular libraries exist that provide easy access to 
such sites. 

An early work in the peptide field is WO 84/03564, re- 
lated to a method of detecting or determining antigeni- 
cally active amino acid sequences or peptides in a pro- 
tein. This work, providing the so-called Pepscan tech- 
nology, whereby a plurality of different peptides is syn- 
thesised by linking with a peptide bond a first amino acid 
to a second, and so on, and on a second position in the 
test format yet another first amino acid is linked to a sec- 
ond, and so on, after which the synthesised peptides 
are each tested with the binding molecule in question, 
allows the determination of every continuous antigenic 
determinant or continuous epitope of importance in a 
protein or peptide sequence. Pepscan technology taken 
in a broad sense also provides for the testing for or iden- 
tification of (albeit linear) peptides essentially identical 
with, analogous to or mimicking binding sites or ligands 
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of a various nature (mimltopes, Geyssen at al, Mol. Im- 
munol. 23:709-715, 1986). 

Pepscan technology allows identification of linear pep- 
tide sequences interacting with receptor molecules, en- 
zymes, antibodies, and so on, in a rapid and straightfor- s 
ward fashion, allowing testing a great many peptides for 
their reactivity with the binding molecule in question with 
relatively little effort. The order of magnitude of testing 
capability having been developed with Pepscan tech- 
nology (e.g. also due to miniaturisation of test formats, 10 
see for example WO 93/09872) furthermore allows at 
random testing of a multiplicity of peptides, leading to 
automated combinatorial chemistry formats, wherein a 
great many of binding molecules are being tested in a 
(if so desired at random) pattern for their reactivity with is 
a molecular library of synthetic peptides representing 
potential continuous binding sites or ligands, allowing 
the rapid detection of particularly relevant molecules out 
of tens of thousands of combinations of molecules test- 
ed. 20 
However, for the testing of discontinuous or conforma- 
tional binding sites to a binding molecule, no formats 
similar to or as versatile as Pepscan technology exist. 
Attempts to identify discontinuous epitopes by Pepscan 
technology are cumbersome. It does in general not suf- 25 
fice to merely extend synthesis of the test peptides by 
linking more amino acids to the existing peptide, and 
hoping that some of the thus formed longer peptides will 
fold in such a way that at least two distinct parts are pre- 
sented in a discontinuous fashion and are recognised 30 
by a binding molecule. Than there is no way of finding 
out in a rapid and straightforward fashion that the bind- 
ing is indeed through a discontinuous binding site, it 
might be that just a longer single loop is responsible for 
the binding. 55 
[0005] Some additional possibilities are provided by 
testing synthetic peptide sequences that have been de- 
signed to comprise two previously identified parts of a 
binding site, each part in essence being linear and being 
part of a larger linear peptide. Early work herein was 40 
done by Atassi and Zablocki (J. Biol. Chem 252:8784, 
1977) who describe that spatially or conformationally 
contiguous surface residues (which are otherwise dis- 
tant in sequence) of an antigenic site of egg white lys- 
ozyme were linked by peptide bonds into a single pep- 
tide which does not exist in lysozyme but attempts to 
simulate a surface region of it. However their technique, 
called surface simulation synthesis, requires the de- 
tailed knowledge of the three-dimensional structure of 
the protein understudy and a full chemical identification so 
of the residues constituting the binding site at before- 
hand, as well as their accurate conformational spacing 
and directional requirements. 

In the same fashion, Dimarchi et al (Science 232: 
339-641 , 1 986) describe a 38 to 40 amino acid long syn- 55 
thetic peptide consisting of two previously identified sep- 
arate peptidyl regions of a virus coat protein. The pep^ 
tide was synthesised using common peptide synthesis 




technology (Merrifield et al., Biochemistry 21, 5020, 
1 982) by adding subsequent amino acids with a peptide 
bond to an ever growing peptide resulting in a peptide 
wherein the two peptidyl regions were connected by a 
diproline spacer presumably functioning as indication of 
a secondary structural turn, thereby thus providing a 
two-part epitope or binding site. 
However, it is clear that when one already at beforehand 
has to know the sequence of the (in this case only) two 
relevant parts, in order to provide the desired discontin- 
uous binding site, It excludes the feasibility to provide 
(desirably in a random fashion) a whole array of merely 
potential discontinuous binding sites for large scale test- 
ing. Furthermore, a major drawback of these strategies 
is that again only linear epitopes or dominant binding 
regions of discontinuous eptitopes can be mimicked ad- 
equately. For the complete synthesis of a discontinuous 
binding site, all the contributing parts have to be ar- 
ranged in the proper comformation to achieve high-af- 
finity binding, therefore, single parts of discontinuous 
binding sites have to be linked. 

Fifteen years after Dimarchi, Reineke et al (Nature Bio- 
technology, 17:271-275, 1999) provided a synthetic 
mimic of a discontinuous binding site on a cytokine and 
a method to find such a discontinuous binding site that 
allowed for some flexibility and somewhat larger scale 
of testing, wherein positionally addressable peptide col- 
lections derived from two separate regions of the cy- 
tokine were displayed on continuous cellulose mem- 
branes, and substituted in the process to find the best 
binding peptide. After selection of the "best reactors" 
from each region, these were combined to give rise to 
another synthetic peptide collection (comprising pep- 
tides named duotopes) that again underwent several 
rounds of substitutions. 

Reineke et al hereby provide synthesis of peptide chains 
comprising duotopes, however, again selected after 
previous identification of putative constituting parts with 
Pepscan technology, thereby still not allowing testing 
discontinuous binding sites in a rapid and straight for- 
ward fashion. However, as indicated before, protein do- 
mains or small molecules that mimic binding sites are 
playing an increasing role in drug discovery, diagnostics 
and biotechnology. The search for particular molecules 
that bind to a binding site and mimic, or antagonise the 
action of a natural ligand has been initiated in many lab- 
oratories. As indicated before, attempts to find such 
structures in synthetic molecular libraries often fail be- 
cause of the essentially discontinuous nature and spa- 
tial complementarity of most binding sites. Thus, for the 
many more cases where the binding site may essentially 
be discontinuous improved means and methods to iden- 
tify these sites are needed, and in particular means and 
methods are needed that allow testing for.discontinuous 
binding sites whereby said parts need not necessarily 
first be selected by previous identification as a putative 
or even only tentative constituting part of the desired dis- 
continuous binding site, but bear only the potentiality of 
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being part of that site by being a molecule with more or 
less distinct features per se. 

[0006] The invention provides a method for producing 
a molecular libra ry comprising providing said library with 
a plurality of molecules wherein said molecules have es- 5 
sentially been produced by segmental linkage, that is by 
linking (di-, tri, oligo- or multimeric) segments of for ex- 
ample nucleic acids or peptides, instead of by sequen- 
tially synthesising said molecules which is done tradi- 
tionally. The invention thus provides a molecular library 10 
that, albeit also suited for detecting or screening for con- 
tinuous binding sites, is now particularly well suited for 
detecting or screening for discontinuous binding sites, 
in particular in relation to binding molecule-ligand inter- 
actions such as for example protein-protein, protein-nu- is 
cleic acid, and nucleic acid-nucleic acid interactions. 
Said segments can of course be selected at random 
from any set of dl-, tri,- or oligomeric sequences, such 
as from di-, tri,- or oligonucleotides, or di-, tri-, or oli- 
gopeptides, but sometimes it may be preferred to in- 20 
elude at least one specific segment in said segment, 
specific in a sense that it has been selected from among 
known segments or distinct parts of biomolecules, such 
as parts of genes, proteins, enzymes, nucleic acids or 
unique fragments thereof, proteins involved in up- or 25 
downregulation of translation, t-RNAs, SNRP's, anti- 
bodies, antigens, receptors, transport proteins, tran- 
scription factors or factors involved in up- or downregu- 
lation of transcription, promoter sequences such as but 
not necessarily restricted to the well known TATA-box 30 
elements, repressor sites, operator sites and other con- 
trol elements, polymerases, replicases, in short, from 
among known segments or distinct parts of binding mol- 
ecules known or suspected to be involved in binding via 
a discontinuous binding site. Such known segments or 35 
parts thereof may of course be already known as parts 
constituting a discontinuous binding site, however, pre- 
vious identification as such is in essence not necessary, 
since screening for such sites with a molecular library 
according to the invention allows rapid and straightfor- 40 
ward identification of said constituting segments or parts 
thereof. 

Screening such a library can easily be envisioned when 
the library's molecules differ only in that constituting 
segments are chosen in an overlapping fashion, where- 45 
by a first segment from a distinct biomolecule is linked 
to a second, and to a third, and to a fourth segment, and 
a second is linked to a third, and to a fourth, and so on, 
if so required until all possible segments of said biomol- 
ecule have been linked two-by-two (or three-by-three, so 
or even more) together, which allows for a systematic 
screening of said biomolecule. However, linking in a 
overlapping fashion is of course not required, random 
segment links will provide valuable information about 
binding sites as well. 55 
The invention thus provides a method for producing a 
molecular library for identification or detection of a bind- 
ing site capable of interacting with a binding molecule, 



and thus for the identification of a molecule as a binding 
molecule, said method comprising providing said library 
with a plurality of molecules, further comprising gener- 
ating at least one of said molecules, preferably a greater 
part, most preferably essentially all of said molecules, 
by at least linking a first segment to a second segment, 
for example a segment which comprises a dimer, trimer, 
oligomer or multimer. 

Existing libraries, be it of for example nucleic acid (con- 
taining a repetitive back-bone of nucleotides, nucleo- 
sides or peptide nucleic acid, or combinations of these) 
or amino acid (containing a repetitive back-bone of ami- 
no acids) nature have in general in common that they 
comprise oligomeric or multimeric molecules, such as 
stretches of nucleic acids or amino acids, that have been 
produced by sequentially linking, in a repetitive fashion, 
one monomer (e.g. a nucleotide or an amino acid) to 
another, until a (in essence polymeric) molecule of the 
desired length has been obtained. Essentially, existing 
nucleic acid libraries comprise nucleic acids that are 
synthesised sequentially, by adding one nucleotide or 
nucleoside at a time to the growing stretch, and existing 
peptide libraries comprise peptides that are synthesised 
sequentially, by adding one amino acid at the time to a 
growing stretch, until the desired length has been 
reached. With nucleic acids said monomers are essen- 
tially selected from a limited set of well known nucle- 
otides, with peptides, said monomers are essentially se- 
lected from a well known set of amino acids. Not only 
naturally occurring monomers are used, synthetic nu- 
cleotides, such as peptide nucleic acid (PNA) mole- 
cules, or non-naturally occurring amino acids, or even 
D-amino acids, are routinely used as monomers by 
which the essentially polymeric molecules are generat- 
ed or produced, using a method that is essentially con^ 
form the sequential synthesis of polymers from mono- 
meric molecules in nature. 

[0007] The invention provides the recognition that es- 
sentially using dimeric or even larger (tri-, oligo-, or mul- 
timeric) segments, and thus stepping out of fashion with 
sequential nucleic acid or protein synthesis as it essen- 
tially occurs in nature, offers distinct advantages. It not 
only provides a faster method to arrive at a molecule 
composed of various segments, it also provides for fast 
and efficient shuffling of segments to generate a mole- 
cule repertoire for the desired library. The invention for 
example provides a method wherein synthesis is started 
with a monomer to which a second segment comprising 
a dimer, such as a dinucleotide or a dipeptide is added. 
Herein, a segment comprising a dimer at least consists 
of a dimer but can also be for example a trimer, or any 
other multimer, linking monomers of any nature, as re- 
quired. Of course, once two segments have been linked, 
further segments can be linked. 
In a preferred embodiment, to speed up further synthe- 
sis, orto be able to select distinct desired segments, the 
invention provides a method wherein said first segment 
also comprises a dimer, and in a yet even more pre- 
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ferred method, further segments comprise dimers as 
well. In a preferred embodiment, said dimer comprises 
a dinucleotide or dipeptide, but of course other dimers 
can be made also. The invention is further explained in 
the detailed description where several of the examples 5 
relate to libraries comprising molecules wherein each of 
said segments comprises a peptide, such as a tri-, a 
penta, an octa-, or nonapeptide; it is however also pro- 
vided by the invention to use segments of a varied na- 
ture, e.g. wherein one comprises a nucleic acid and an- 10 
other comprises a peptide, to better mimic binding sites 
that are for example found on nucleic acid-protein com- 
plexes. 

In a preferred embodiment, as for example shown in the 
examples, the invention provides a method wherein said 15 
first segment is linked by a thioether bond to said second 
segment, however, the invention is of course not limited 
thereto. Nucleotide/side segments can for example be 
covalently linked or ligated by splicing enzymes or iigas- 
es, or by overlapping a first segment and the second 20 
segment with an in essence relatively short nucleotide 
strand that is partly complementary to both segments. 
The invention thus provides a molecular library allowing 
testing for, identification, characterisation or detection 
of a continuous or discontinuous binding site capable of 2 ^ 
interacting with a binding molecule, said library having 
been provided with a plurality of molecules, each mole- 
cule of said molecules preferably comprising at least 
one first segment linked to a second segment, wherein 
at least said second-segment previously existed as dim- 30 
er or multimer. Preferably, each segment or part thereof 
having the capacity of being a potential single part of a 
discontinuous binding site, preferably wherein each of 
at least a first and a second segment or part thereof rep- 
resents a potential single part of a discontinuous binding 35 
site. Such a library can for example exist of a synthetic 
molecular library made by chemical linking of segments. 
Preferably, such segments have distinct features, for ex- 
ample by being in essence segments that are, comprise 
or mimic molecular components of living organisms, 40 
such as (combinations of) nucleotides, sugars, lipids, 
amino acids, nucleic acid molecules (DNA or RNA), 
peptide nucleic acid molecules (PNA), carbohydrates, 
fatty acids or fats. Herewith the invention provides syn- 
thesis of mojecules comprising separate segments po- <*5 
tentialiy representing at least two distinct parts of a dis- 
continuous binding site, said parts not necessarily first 
being selected after previous identification of potential 
constituting parts, thereby allowing testing for discontin- 
uous binding sites in a rapid and straight forward fash- so 
ion. 

The invention thus now allows identifying discontinuous, 
binding sites of receptor molecules that interact or bind 
at that contact site with a hormone, a peptide, a drug, 
an antigen, an effector molecule or with another recep^ 55 
tor molecule, of enzymes that bind with their substrate, 
of antibody molecules that bind with a binding site on an 
antigen, nucleic acid that binds with protein, and so on. 



In a preferred embodiment of the invention, at least one 
of said segments comprises a peptide, another segment 
being for example DNA, RNA, PNA, carbohydrate, a fat- 
ty acid, an hormone or an organic molecule altogether.. 
In one embodiment of the invention, all segments com- 
prise a peptide, said segments preferably linked by a 
stable (non-naturally) occurring non-peptide bond or 
linker. In this way a plurality of different peptides is syn- 
thesised by linking a first peptide segment to a second, 
and so on , and on a second position in the test or library 
format yet another first peptide segment is linked to a 
second, and so on, after which the synthesised peptides 
are each tested with the binding molecule in question, 
allowing the determination of a discontinuous antigenic 
determinant or discontinuous epitope of importance in 
a protein or peptide sequence. 

Said peptide segment comprises at least 2 amino acids, 
and can in principle be as long as desired, e.g. contain- 
ing a hundred amino acids or even more. In preferred 
practice, said peptide segment comprises from 3 to 30, 
preferably from 4 to 20, even more preferably from 5 or 
6 to 1 2 to 1 5 amino acids, such as 9 or 1 2 amino acids. 
Separate segments of course do not necessarily have 
to be of equal length. 

Furthermore, peptide segments to be linked together 
can be selected at random, or under guidance of (a) 
known protein or peptide sequence(s). Selection at ran- 
dom provides a random library according to the inven- 
tion. Selection from known protein or peptide sequences 
is for example useful when it is desired to find out wheth- 
er a discontinuous binding site is composed of distinct 
sites or parts present at distinct proteins or peptides, for 
example in a protein complex to which a particular bind- 
ing molecule can bind. Selection of various peptide seg- 
ments from one known protein or peptide sequence is 
useful when it is desired to find out whether a discontin- 
uous binding site is composed of distinct sites or parts 
present at one protein or peptide, for example at a folded 
protein to which a particular binding molecule can bind. 
Selection of peptide segments can be done by selecting 
overlapping peptides from such a known sequence. 
Overlapping peptides can have for example all but one 
or two amino acids in common, preferably overlapping 
in a contiguous fashion, or can overlap with only one or 
several amino acids. For a quick scan for discontinuous 
binding sites on a known protein, It is for example useful , 
to select nonapeptide segments from said protein se- 
quence, of which one has for example a 5-amino acid 
long overlap with another peptide segment. Equally use- 
ful, however, is to select tripeptide segments from said 
sequence having an overlap of only one amino acid, and 
use three, or even more segments in constructing the 
putative binding site molecule to which the to be tested 
binding molecule can bind. 

Of course, such selection strategies are equally appli- 
cable to segments of a different nature, nucleic acid seg- 
ments, comprising a certain number of . nucleotides, 
such as 5, 7, 9, and so on, can be selected from known 
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nucleic acid sequences comprising sought after discon- 
tinuous binding sites, each segment selected from a cer- 
tain position in said known nucleic acid sequence, if de- 
sired also in a overlapping fashion. Said nucleic acid 
segment comprises at least 2 nucleotides (be it DNA, 
RNA or PNA, or functional equivalents thereof), and can 
in principle be as long as desired, e.g. containing a hun- 
dred nucleotides or even more. In preferred practice, 
said nucleic segment comprises from 3 to 30, preferably 
from 4 to 20, even more preferably from 5 or 6 to 12 to 
15 nucleotides, such as 9 or 12 nucleotides. Separate 
segments of course do not necessarily have to be of 
equal length, and, as said before, can even be of a dif- 
ferent nature, e.g. peptide with DNA. 
Herein a peptide bond is being defined as an amide 
bond between an alpha-amino group of one amino acid 
or peptide and ah alpha-carboxyl group of another ami- 
no acid or peptide. A non-peptide bond comprises any 
other amide bond or non-amide bonds. The links or 
bonds can be formed using many, combinations of strat- 
egies of for example peptide or nucleotide chemistry 
and selective ligation reactions as known in the art. Li- 
gation chemistry has been published, for instance, by 
groups of Kent (Ph. E.Dawson et al., Synthesis of Pro- 
teins by Native Chemical Ligation, Science 266 (1994) 
776-779), Tarn (J.P.Tam et al., Peptide Synthesis using 
Unprotected Peptides through Orthogonal Coupling 
Methods, Proc. Natl. Acad. Sci. USA 92 (1995) 
12485-12489; C.F.Liu et al, Orthogonal Ligation of Un- 
protected Peptide Segments through Pseudoproline 
Formation for the Synthesis of HIV-1 Protease Analogs, 
J.Am.Chem.Soc. 118 (1996) 307-312; L.Zhang & J. P. 
Tarn Thiazolidone Formation as a General and Site-spe- 
cific Conjugation Method for Synthetic Peptides and 
Proteins, Analytical Biochemistry 233 (1996) 87-93), 
and Mutter (G.Tuchscherer & M. Mutter, Protein Design 
as a Challenge for Peptide Chemists, J. Peptide Science 
1 (1995) 3-10; S.E.Cervigni et al, Template-assisted 
Protein Design: Chimeric TASP by Chemoselective Li- 
gation, Peptides: Chemistry, Structure and Biology, P.T. 
P Kaumaya & R.S. Hodges eds, Mayflower (1996) 
555-557). 

Possible strategies for the formation of links as prefer- 
ably provided by the invention are for example are: 

1 . Said link with a segment or segments is formed 
using a homo- or hetero-bifunctional linking agent 
(S.S.Wong: Chemistry of Protein Conjugation and 
Cross-Linking, CRC Press Inc, Boca Raton, Florida 
USA 1991). In this construction a reactive group in 
one segment is used to react with one part of the 
bifunctional linking agent, thus facilitating the sec- 
ond part of the linking agent to react with a reactive 
group from a second segment. For instance, a linker 
like MBS (m-maleinimidobenzoic acid N-hydroxy- 
succinimide ester) can be used to react via its active 
ester (succinimide) with an amino group of one seg- 
ment and via its maleinimide group with a free thiol 



group from a second segment. In this strategy pref- 
erably no other free amino- or thiol groups should 
be present in the first segment and preferably no 
other free thiol groups are present in the second 
segment. In order to accomplish this, the amino or 
thiol groups that should be involved in the reaction 
can be deprotected selectively, for instance, by us- 
ing a side chain protecting group that can be 
cleaved by a mild reagent like 1% trifluoroacetic ac- 
id, which leaves other side chain protecting groups 
intact. 

2. Said link is formed by introduction of a modified 
amino acid in the synthesis of one or more seg- 
ments. Amino acids can be modified, for instance, 
by introduction of a special group at the side-chain 
or at the alpha-amino group. A modification at the 
alpha-amino group leads to an amide or backbone 
modified peptide (see fort example Gillon et al., Bi- 
opolymers, 31:745-750, 1991). For instance, this 
group can be a maleinimido group at the side chain 
amino group of lysine. At the end of the peptide syn- 
thesis this group will react fast and selective with a 
thiol group of a second segment. Tarn et al. (PNAS 
92:12485-12489, 1995) described a synthesis of a 
peptide with a lysine residue that was modified in 
the side chain with a protected serine residue. After 
deprotection and selective oxidation using perio- 
date, the alpha-amino, beta-hydroxy function of the 
serine was converted into an aldehyde function that 
could be ligated selectively with another thiol-bear- 
ing segment. Also peptide backbone links, via 
groups attached to the amide groups of the peptide, 
can be used to link segments (Bitan et al., J. Chem. 
Soc. Perkin Trans. 1:1 501 -1510, 1997; Bitan and 
Gilon,Tetrahedon, 51:1 051 3-1 0522, 1995; Kaljuste 
and Unden, Int. J. Pept. Prot. Res, 43:505-511, 
1994). 

3. Yet another way to form said link is to synthesise 
a segment, such as a peptide, with a modified N- 
terrninus. For instance, an N-terminal aipha-haloa- 
cetamido group can be introduced at the end of the 
synthesis. This group reacts fast and selectively 
with a second segment, i.e. another peptide, which 
contains a thiol group. For instance, the first seg^ 
ment is synthesised with an N-terminal bromba- 
cetamide and the second segment with a cysteine. 
Although most alpha-haloacetamide groups, like 
chloro-, bromb-, or iodoacetamide, will react with 
thiol groups, in those cases where speedy assem- 
bling is required, the bromoacetamide group is pre- 
ferred because of its ease of introduction and fast 
and selective reaction with thiol groups. 

Furthermore, the invention provides the possibility to ad- 
55 dress the link in every position of the first and/br the sec- 
ond or consecutive segment. For instance, for peptide 
segments sets of peptides are synthesised in which a 
cysteine or a side-chain modified lysine, both amino acid 
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residues being able to ligate selectively with another 
segment, shifts from the N-terminal amino acid position 
one by one to the C-terminal amino acid position. Com- 
binations of these possibilities will again lead to new li- 
braries as provided by the invention. 
In a preferred embodiment, the invention provides a li- 
brary wherein said molecules are positionally or spatial- 
ly addressable, e.g. in an array fashion, if desired aided 
by computer directed localisation and/or recognition of 
a specific molecule or set of molecules within the dimen- 
sions (e.g. plane or surface) of the support of the library 
used. In an array, said molecules are for example ad- 
dressable by their positions in a grid or matrix. 
A preferred embodiment of the invention further allows 
upscaling of the synthesis concerning the number of 
constructs on for example a solid support per cm 2 . To- 
facilitate generation of a great many possible con- 
structs, containing for example molecules comprising at 
least two peptide segments of a protein, many thou- 
sands of peptide constructs are made. For instance, 
when all constructs, in which both segments are for in- 
stance 12 amino acids long, are derived from a small 
protein with a length of 100 amino acid residues are 
needed, already 89 x 89 = 7921 peptide constructs are 
made, if the segments are only linked, for instance, via 
the C-terminus of the first segment and the N-terminus 
of the second segment using only one type of link. For 
a protein with a length of 1000 amino acid residues at 
least 989 x 989 = 978121 constructs are made. For ef- 
ficient ELISA testing of these numbers of constructs, 
high construct densities on the solid support are pre- 
ferred. High densities of constructs on a solid support 
are provided by the invention, wherein for instance, (a 
layer of) a first segment with a bromoacetamide group 
at the N-terminus is synthesised on a surface of, for in- 
stance, 1 cm 2 . On this peptide-functionalised surface of 
the support a set of, for instance, 1 0, preferably 50, pref- 
erably 100, or more second peptide segments contain- 
ing a free thiol group are spotted or gridded, in a posi- 
tionally or spatially addressable way, giving, after cou- 
pling, so many different peptide constructs. Spotting 
can, for instance, be done using piezo drop-on-demand 
technology, or by using miniature solenoid valves. Grid- 
ding can, for instance, be done using a set of individual 
needles that pick up sub-microliter amounts of segment 
solution from a microtlter plate, containing solutions 
comprising the second segments. After the Jinking reac- 
tion, subsequent deprotection and extensive washing of 
the support to remove uncoupled peptide gives at least 
a peptide construct density as large as 1 0 to 50, or even 
1 00 to 200, or up to 50 to 1 000 spots per cm 2 . This den- 
sity allows to screen a great many possible peptide con- 
structs of said proteins for binding with an antibody. For 
example: in a preferred embodiment 20000 to 100.000 
constructs are made on 1 000 cm 2 , typically the surface 
is than screened for binding in ELISA with 1 00 ml of an- 
tibody solution,. containing 1 - 10 jig of antibody/ml. For 
example, indirect or direct fluorescence detection allo- 



cates antibody binding constructs. Direct fluorescence 
detection with confocal scanning detection methods for 
example allows antibody detection on spots generated 
with droplets peptide-solution in the sub-nanoliter 
5 range, making even higher construct densities feasible. 
Of course, nucleic acid libraries can be made in a similar 
fashion. 

Furthermore, the invention provides a solid support 
comprising a library according to the invention, said sol- 

10 id support allowing presentation of a potential discontin- 
uous or conformational binding site or epitope to a bind- 
ing molecule, said solid support having been provided 
with a plurality of molecules, each molecule of said mol- 
ecules being a possible representative of said binding 

is site or epitope and for example comprising at least one 
first peptide or nucleotide covalently linked by a spacer 
to a second peptide or nucleotide, said spacer compris- 
ing at least one non-peptide linkage. 
In a preferred embodiment, said solid support compris- 

20 es at least a spot or dot (e.g. putative binding site or 
peptide construct) density as large as 10, 20, or 50, or 
even 1 00, 200, or up to 500 or even 1 000 spots per cm 2 ' 
preferably wherein said spots or dots are positionally or 
spatially addressable. 

25 The invention further provides a method to screen for, i. 
e. test, identify, characterise or detect a discontinuous 
binding site capable of interacting with a binding mole- 
cule, comprising screening a library as provided by the 
invention with binding molecules, such as there are for 

30 instance antibodies, soluble receptors, which contain a 
Fc-tail or a tag for detection, receptors on cells, bioti- 
nylated molecules or fluorescent molecules. Alternative 
segments could comprise, for instance, carbohydrates, 
non-natural amino acids, PNA's, DNA's, lipids, mole- 

35 cules containing peptide bond mimetics. In particular, 
the invention provides a method to screen for a discon- 
tinuous binding site capable of interacting with a binding 
molecule, comprising screening a library according to 
the invention with at least one binding molecule and de- 

40 tecting binding between a member of said library and 
said binding molecule. In a preferred embodiment, said 
binding is detected immunologically, for example by ELI- 
SA techniques. 

By detecting binding to a specific member of said library, 
45 the invention provides said member, a synthetic mole- 
cule comprising a discontinuous binding site Identifiable 
or identified or obtainable or obtained by a method ac- 
cording to the invention. Thus the invention provides use 
of a library according to the invention, use of a solid sup- 
so port according to the invention, or use of a method ac^ 
cording to the invention for identifying or obtaining a syn^ 
thetic molecule comprising a discontinuous binding site 
or a binding molecule capable of binding therewith. Be- 
cause now discontinuous binding sites are provided, 
55 such a synthetic molecule can advantageously be used 
in vitro or in vivo for finding a binding molecule, and for 
effecting and/or affecting binding to a binding molecule, 
for example to interact or bind with receptor molecules 
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that normally interact or bind with a hormone, a peptide, 
a drug, an antigen, with an effector molecule, with an 
agonist, with an antagonist, or with another receptor 
molecule; with enzymes that normally bind with their 
substrate; with antibody molecules, with nucleic acid, 5 
with protein, in short, with biomolecules. The invention 
is further explained in the detailed description without 
limiting the invention. 

Figure legends. 10 

[0008] Fig. 1. ELISA results of a library of constructs 
of a protein, synthesised in 3 jii wells of a 455 wells mi- 
crotiter plate tested against a protein specific mono- 
clonal antibody. Construct 1: sequenced -11]-Cys cou- 15 
pled to bromoacetamide-sequence[14-25]; construct 2: 
sequence[2-12]-Cys coupled to bromoacetamide-se^ 
quence[15-26]; and so on. 

[0009] Fig. 2. Microturisation of spots of peptide con- 
structs. Constructs were tested against the same mon- 20 
ocional antibody as was tested in figure 1 . Binding was 
made visible using indirect fluorescence detection. Pep- 
tide 1, sequence [139- 150] with an N-terminal bromoa- 
cetamide, was synthesised on the complete surface. 
Peptide construct 6 - 1 is the same as construct 125 in 25 
figure 1 . Peptides 2 up to 8, containing a cysteine resi- 
due, were spotted in different volumes ranging from 1 
jil to 0.25 nl using piezo drop-on-demand technology. 
[0010] Fig. 3. Schematic presentation of a two. seg- 
ment scan. 12345678901 (segment 2) and 30 
NOPQRSTUVWXY (segment 1) represent successive 
sequences derived from a protein. Both segments are 
linked via a thioether bridge, formed by reaction of a free 
thiol function of a C-terminal cysteine residue (C) in seg- 
ment 2 and a bromoacetamide group ($) at the N-termi- 35 
nus of segment 1, In this scan both sequences can be 
shifted simultanuously by steps of one amino acid resi- 
due through the complete protein sequence to obtain 
the library. SS = Solid Support. 

[0011] Fig. 4. Schematic presentation of a complete 40 
matrix scan. 12345678901 and ABCDEFGHIJK repre- 
sent sequences derived from a protein. Both sequences 
are linked as described in fig. 3. In this scan both se- 
quences are shifted independently through the com- 
plete protein sequence, generating a library of all pos- 45 
sible sequence combinations. 

[0012] Fig. 5. Schematic presentation of a complete 
matrix scan. This scan is similar to the scan shown in 
fig. 4 : however, the cysteine residue is positioned at the 
N-termi nus of the second segment, leading to a re- so 
versed of both segments. 

[0013] Fig. 6. Schematic presentation of a positional 
complete matrix scan. This scan is similar to the scans 
shown in figs. 4 and 5, however, no cysteine residue was 
added to one of the termini of the second segment, but 55 
instead each residue of the second segment sequence 
was substituted one by one by a cysteine residue. 
[0014] Fig. 7. Schematic presentation of a multi seg- 
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ment scan. 12345678901 (segment 1), NOPQRSTU- 
VWXY (segment 2) and 

[0015] BCDEFGHIJKLM (segment 3) represent suc- 
cessive sequences derived from a protein. Segments 
were linked via a thioether bridge, formed by reaction of 
a free thiol function of a C-terminal cysteine residue (C) 
in segment 1 and a bromoacetamide group ($) at the N- 
terminus of segment 2 and so on, as described in ex- 
ample 3. in this scan all sequences are subsequently 
shifted simultanuously through the complete protein se- 
quence to obtain the complete library. 
[0016] Fig. 8. Schematic presentation of a scan of in- 
teracting proteins 1, 2 and 3. ABCDEFGHIJK and AB- 
CDEFGHIJK represent two independent sequences 
from two different proteins combined in one construct. 
A: a matrix scan of these segments was tested against 
a complete, labelled third protein. B: a matrix scan of 
segments from HIV proteins and the CCR5 protein was 
tested against a labelled soluble CD4 protein. 
[0017] Fig. 9. Schematic presentation of a matrix 
comb i -scan with a complete protein. The constructs are 
scanned with a another labelled protein in solution. 
[0018] Fig. 1 0. Schematic presentation of a DNA/RNA 
scan. The constructs are scanned with a labelled pro- 
tein, for example a regulatory protein, or another DNA 
or PNA strand. 

[0019] Fig. 1 1 . Schematic presentation of a PNA/DNA 
test. The upper part shows the principle of a test. A 
cysteine modified PNA was reacted with a bro- 
moacetylated solid support, the bound PNA was tested 
against a biotin labelled DNA strand. In an ELISA sys- 
tem the labelled DNA was detected using HRPO la- 
belled streptavidine. The lower part of the figure shows 
the reactivity in ELISA of the PNA with respectively a 
complementary DNA strand, a complementary strand 
with a single mutation, an unrelated DNA strand and a 
buffer control. SS = solid support. 
[0020] Fig. 12. Schematic presentation of an intracel- 
lular protein scan. 

Detailed description 

SYNTHESIS OF PEPTIDE CONSTRUCTS 

[0021] A peptide with a N -terminal bromoacetamide 
group was synthesised at the surface of a solid support 
containing free amino groups. The peptide still con- 
tained the side-chain protecting groups of the amino ac^ 
id residues. A second peptide containing a cysteine res- 
idue, which was deprotected and cleaved from another 
solid support was reacted with the bromoacetamide 
peptide on the first solid support. The formed construct 
was deprotected, but not cleaved from the support, and 
could be used in ELISA. 

A polypropylene or polyethylene support, or of other 
suitable material, was grafted with, for instance, poly- 
acrylic acid. As an example: a polypropylene support in 
a 6 % acrylic acid solution in water, containing CuS0 4 , 
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was irradiated using gamma radiation at a dose of 12 
kGy. The grafted solid support containing carboxylic ac- 
id groups was functionalised with amino groups via cou- 
pling of t-butyloxycarbonyl-hexamethylenediamine 
(Boc-HMDA) using dicyclohexylcarbodiimide (DCC) 
with N-hydroxybenzotriazole (HOBt) and subsequent 
cleavage of the Boc groups using trifluoroacetic acid. 
Standard Fmoc peptide synthesis was used to synthe- 
sise peptides on the amino functionalised solid support. 
After cleavage of the Fmoc group of the last amino acid 
and washing, bromoacetic acid was coupled using DCC 
or DCC/HOBt. If only DCC was used the peptide did 
contain a thiol reactive bromoacetamide group, howev- 
er, if DCC/HOBt was used to couple bromoacetic acid, 
the peptide essentially did not contain the bromo group, 
but another reactive group capable to react efficiently 
with thiol groups thus forming the same thloether link 
between the segments. 

[0022] Coupling/ligation of a second peptide to a pep- 
tide synthesised on a solid support: 
Peptides were synthesised at polyethylene pins grafted 
with poly-hydromethylmethacrylate (poly-HEMA). This 
graft polymer was made by gamma irradiation of poly- 
ethylene pins in a 20% HEMA solution in methanol/wa- 
ter 80/20 or 70/30 at a dose of 30-50 kGy. The function- 
alised support can be used for the synthesis of 1 jxmol 
of peptide/cm 2 after coupling of p-alanine and an acid 
labile Fmoc-2,4-dimethoxy-44carooxymethyloxy)-ben- 
zhydrylamine (Rink) linker. The peptides were synthe- 
sised using standard Fmoc chemistry and the peptide 
was deprotected and cleaved from the resin using trif- 
luoroacetic acid with scavengers. 
The cleaved peptide containing a cysteine residue at a 
concentration of about 1 mg/ml was reacted with the sol- 
id support bound peptide described above in a water/ 
sodium bicarbonate buffer at about pH 7-8, thus forming 
a partially protected construct of two peptides covalently 
bound via a thioether and C-terminally bound to the solid 
support. 

The construct described above was deprotected follow- 
ing standard procedures using trifluoroacetic acid/scav- 
enger combinations. The deprotected constructs on the 
solid support were extensively washed using disrupting 
buffers, containing sodium dodecylsulphate and p-mer- 
captoethanol, and ultrasonic cleaning and were used di- 
rectly in ELISA. Subsequent cleaning in the disrupt buff- 
ers allows repeatingly testing against other antibodies 
in ELISA. 

[0023] Figure 1 shows an example of the ELISA re- 
sults of screening a simple library of constructs, consist- 
ing of a dodecapeptide segment coupled via its C-ter- 
minally added cysteine residue to a N-terminaily bro- 
moacetylated second segment, scanning a protein se- 
quence by steps of a single amino acid residue. The bro- 
moacetamide peptide was covalently bound to a func- 
tionalised polypropylene/polyacrylic acid solid support 
in 3 u.l wells as described above. The cysteine-contain- 
ing sequences were synthesised on and cleaved from 



functionalised polyethylene pins as described above. As 
shown in figure 1, high binding was observed in ELISA 
for constructs around position 125, which consists of the 
segments [125-136] and [139-150], linked via a 

5 thioether bond. A conventional linear PEPSCAN of do- 
decapeptides or 15-peptides did not show any binding 
in a reaction against the same monoclonal antibody. On 
a surface of a solid support peptides are synthesized 
with a bromoacetamide group at the N-terminus as de- 

w scribed above. On this peptide functionalized support a 
second peptide segment containing a free thiol group 
was spotted using piezo drop-on-demand technology, 
using a microdosing apparatus and piezo autopipette 
(Auto Drop-Micropipette AD-K-501) (Microdrop Ges- 

*5 ellschaft fur Mikrodosier Systeme GmbH. Alternatively, 
spotting or gridding was done using miniature solenoid 
valves (INKX 0502600A; the Ice Company) or hardened 
precision ground gridding pins (Genomic Solutions, di- 
ameters 0.4, 0.6, 0.8 or 1 .5 mm). Subsequent deprotec- 

20 tion of the construct and extensive washing to remove 
uncoupled peptide gave dipeptide constructs at the 
spotted area. 

Figure 2 shows binding of the same antibody as was 
tested in figure 1 with constructs consisting of two pep- 

25 tide segments, generated with different volumes of spot- 
ted peptides 2 to 8, ranging from 1 jjlI - 0.25 nl (x-axis). 
Within the square the whole surface was covered with 
peptide 1 , which was synthesised directly on this sur- 
face, only the spots contain constructs. The y-axis 

30 shows different constructs, consisting of peptide 1 with 
peptide 2 up to 8. Peptides 2 up to 8 are overlapping 
dodecapeptides, while peptide 1 is sequence [1 39-1 50] 
of the same protein as described in figure 1. Figure 2 
shows that peptide constructs generated with peptide 

35 solution dropiets in the nanoliter-range, bind enough an- 
tibody for detection, in this case using indirect fluores- 
cence detection. Spots generated with 0.25 nl - 50 nl 
aresmallerthan 1 mm 2 . Thus, peptide construct density 
can be as large as 100-1000 spots per cm 2 . 

40 

Examples of use 

[0024] Proteins and peptides can be screened using 
any type of binding molecule, e.g. biomolecules such as 
45 antibodies, soluble receptors, which contain a Fc-tall or 
a tag for detection, biotlnylated molecules or fluorescent 
molecules. Alternative segments could be, for instance, 
carbohydrates, non-natural amino acids, PNA's, DNA's, 
lipids, molecules containing peptide bond rnimetics. In 
so the examples $ is used as a symbol for a thioether link 
formed by reaction of the thiol group of a cysteine resi- 
due of one segment with a bromoacetamide group from 
another segment. This symbol can also be used for oth- 
er linking chemistries as described, and thus describes 
55 . links between segments. 
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Example- 1: 'standard' 24-30-mer scan of linear 
sequence, containing two segments. 

[0025] In this example the consecutive sequences of 
the segments are both shifted one by one residue 
through the sequence of the protein to be tested as 
shown in figure 3. The -C$- link between both segments 
replaces 0 - 2 or more amino acid residues of the native 
protein sequence. Applications include replacement 
sets of peptides, in which amino acid residues are re- 
placed systematically by other amino acid residues, de- 
letion sets of peptides, in which amino acid residues are 
deleted systematically, and combination sets, in which 
peptides of different lenght ranging from 2 - 24 (here 
segment 2, 2-40 or more and segment 1 , 2-1 5 or more) 
amino acid residues are used. 

Example-2a: tail-to-head matrix-scan. 

[0026] In a complete matrix-scan the N-terminal se- 
quence of, for instance s sequence [1 - 11] of a protein, 
is linked as a segment with each overlapping peptide 
sequence of a complete scan of the same protein as 
shown in figure 4. Next, sequence [2-12] is linked with 
the same set of overlapping sequences and so on. The 
link can be formed, for instance, by reaction of a cysteine 
at the C-terminus of the second segment with a bromoa- 
cetamide modified N-terminus of the first segment. This 
means that every combination of, for instance, unde- 
capeptides from the protein sequence is being synthe- 
sised on a separate, known, position of the solid sup- 
port. 

Example-2b: head-to-head matrix-scan. 

[0027] This is the same scan as the complete matrix 
scan from example 2a, however, in this scan the 
cysteine of the second segment is located at its N-ter- 
minus, providing a reversed orientation of both seg- 
ments in the construct as shown in figure 5. 

Example-2c: Positional matrix-scan with cysteine at ALL 
positions. 

[0028] This is a scan similar to the complete matrix 
scans described above, however, in this scan the 
cysteine is used to substitute the amino acid residues 
one by one in every position of the second segment as 
shown in figure 6. 

Example 2d: tail-to-taii or head-to-tail matrix scan. 

[0029] Tail-to-tail or head-to-tall matrix scans can be 
obtained if for example a cysteine is incorporated a C- 
terminal amino acid or a first segment on the solid sup- 
port. A second segment is added either as N-terminal 
bromoacetylated peptide or as C-terminal side chain 
bromoacetylated lysyl peptide. 
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Example-3: Multi-segment scan. 

[0030] In this example a thiol fuction is introduced on 
an amino-functionalised solid support. This can be 

5 made by a direct reaction of the amino groups with, for 
instance, imtnothiolane, or by coupling of Fmoc-Cys 
(Trt)-OH, followed by Fmoc cleavage using piperidine, 
acetylation, andtrityl deprotection using TFA/scavenger 
mixtures. This thiol-functionalised solid support can be 

10 reacted with, for instance, a bromoacetamide-peptide, 
containing a protected cysteine residue. After coupling 
of the first peptide, the cysteine can be deprotected, us- 
ing, for instance, a TFA/scavenger mixture. The formed 
free thiol group can be used to couple a second bromoa- 

is cetamide-peptide, again containing a protected 
cysteine. This procedure can be repeated to make multi- 
segment constructs. Several types of scans, as de- 
scribed in the other examples, can be used in combina- 
tion with this multi-segment scan. In fig. 7 an example 

20 is shown for a multi-segment scan. 

Example-4: Matrix combi-scan, interaction between 
three (or more) proteins 

25 [0031] In a matrix combj-scari, a matrix scan from two 
different proteins is tested against a labelled soluble pro- 
tein. Figure 8 shows two examples. In the first example 
(fig. 8A) soluble protein 1 (growth hormone, GH) was 
tested against a combined matrix scan of protein 2 (GH- 

30 receptor-1) and protein-3 (GH-receptor-2). In the sec- 
ond example a part of soluble protein 1 (CD-4) was test- 
ed against a combined matrix scan of protein 2 (HIV) 
and protein 3 (ch em okine- receptor CCR4). 
[0032] Examples 1 to 4 describe methods using pep- 

35 tide segments and screening with proteins. These con- 
structs can also be screened against non-proteins. Also 
non-peptide segments can be used. Below, examples 
of whole proteins in combination with peptides (example 
5), or peptides/proteins in combination with non-pep- 

40 tide/proteins, or non-peptide/protein with non-peptide/ 
protein (example 6, DNA/RNA/PNA) are shown. 

Example-5: Matrix combi-scan, interaction between 
three (or more) proteins, 

45 

[0033] This example Is similar to example 4. The dif- 
ference is that the segment 2 sequences, derived from 
one protein (ABCDEFGHUKC etc.) are replaced by a 
complete protein, which contains an added thiol group 
50 for coupling (see fig 9). 

Example-6 : scans of linear DNA/PNA/peptide with 
DNAfrNA/peptide 

55 [0034] This example is similar to that of examples 1 
to 5 with the difference that one or more other non-pep- 
tide segments are used (DNA, RNA or a peptide nucleic 
acid (PNA) instead of a peptide segment). The nude- 
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otide segments or PNA's are modified with groups that 
enable linking of the segments as in examples 1 to 5. 
Screening is performed with labelled DNA strands, pep- 
tides or proteins (see fig 1 0 or 1 1 ). As alternative labelled 
DNA or PNA strands can also be tested against peptide 5 
construct described in examples 1,to 5. 
[0035] In addition to scanning interaction regions of 
proteins and non-proteins (DNA/RNA) in ELISA, chip or 
blot format it is also possible to use the -C$- coupling in 
in vitro bioassays. Firstly, it is possible to use soluble 10 
constructs as explained in example-3 as potential (ant) 
agonists for membrane bound receptors. Secondly, it is 
possible to use membrane-transporting proteins such 
as transportan or penetratin to get any of the above 
mentioned combinations of peptides or peptides with 15 
PNA or peptides with (small) proteins into the cell. Below 
two examples are shown. 

! Example-7a: Intracellular protein scan coupled to 

membrane-penetrating Transportan. 20 

[0036] An intracellular protein, like a kinase, can be 
scanned using overlapping peptides on a solid support, 
containing a C-terminal cleavable linker. The peptides 
were synthesised with a N-terminal bromoacetamide 25 
group. Next, a membrane penetrating transportan pep- 
tide, containing a label and a thiol group was coupled 
with the sequences. These constructs were selectively 
cleaved from the solid support and tested in a bioassay. 
Labels that can be used are, for instance, biotine or flu- 30 
orescent labels (figure 12). 

Example-7b: Intracellular RNA, DNA or PNA-scan 
coupied to membrane-penetrating Transportan. 

35 

[0037] To identify PNA/DNA sequences that can be 
used to block expression of a particular gene a long lin- 
ear scan, coupled to membrane penetrating peptide, 
can be tested in an in vitro bio-assay. 



Claims 

1 . A method for producing a molecular library for iden- 
tification or detection of a binding site comprising 45 
providing said library with a plurality of molecules, 
further comprising generating at least one of said 
molecules by at least linking a first segment to a 
second segment. 

so 

2. A method according to claim 1 wherein said first or 
second segment comprises at least a dimer. 

3. A method according to claim 1 or 2 wherein said 
dimer comprises a dinucleotide or dipeptide. 55 

4. A method according to anyone of claims 1 to 3 
wherein each of said segments comprise a peptide. 




5. A method according to anyone of claims 1 to 4 
wherein said first segment is linked by a thioether 
bond to said second segment. 

6. A method according to anyone of claims 1 to 5 
wherein each of at least a first and/or a second seg- 
ment or part thereof represents a potential part of a 
discontinuous binding site. 

7. A library comprising a plurality of molecules com- 
prising at least a first and a second segment obtain- 
able by a method according to anyone of claims 1 
to 6. 

8. A library according to claim 7 wherein said mole- 
cules are positionally or spatially addressable. 

9. A library according to claim 7 or 8 wherein each of 
at least a first and/or a second segment or part 
thereof represents a potential part of a discontinu- 
ous binding site. 

1 0. A solid support comprising a library according to an- 
yone of claims 7 to 9. 

1 1 . A method to screen for a binding site capable of in- 
teracting with a binding molecule, comprising 
screening a library according to anyone of claims 7 
to 9 with at least one potential binding molecule and 
detecting binding between a member of said library 
and said potential binding molecule. 

12. A method according to claim 11 wherein said bind- 
ing site is a discontinuous binding site. 

1 3. A synthetic molecule comprising a binding site iden- 
tifiable or obtainable by a method according to claim 
11 or 12. 

14. A binding molecule comprising a binding site iden- 
tifiable or obtainable by a method according to claim 
11 or 12. 

1 5. A molecule according to claim 1 3 or 1 4 wherein said 
binding site comprises a discontinuous binding site. 

16. Use of a library according to anyone of claims 7 to 
9, a solid support according to claim 1 0, or a method 
according to claim 11 or 1 2 for identifying or obtain- 
ing a synthetic molecule comprising a binding site. 

17. Use of a library according to anyone of claims 7 to 
9, a solid support according to claim 1 0, or a method 
according to claim 11 or 1 2 for identifying or obtain- 
ing a binding molecule capable of binding to a bind- 
ing site. 

18. Use of a molecule according to claim 13, 14 or 15 
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for interfering with or effecting binding to a binding 
molecule. 
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